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THE COMBINATORICS OF OCCAM’S RAZOR 


WILLIAM RALPH 

Abstract. Occam’s Razor tells us to pick the simplest model that 
fits our observations. In order to make sense of his process math¬ 
ematically, we interpret it in the context of posets of functions. 
Our approach leads to some unusual new combinatorial problems 
concerning functions between finite sets. The same ideas are used 
to define a nicely behaved and apparently unknown analogue of 
the rank of a group. We also make a construction that associates 
with each group an infinite sequence of numbers called its fusion 
sequence. The first term in this sequence is determined by the 
rank of the group and we provide examples of subsequent terms 
that suggest a subtle relationship between these numbers and the 
structure of the group. 


1. Introduction 

Given a choice of competing theories, Occam’s razor is the principle 
that directs us to pick the simplest one as the most likely to be correct. 
This widely held rule of thumb is named after William of Occam (1285- 
1349), an English philosopher and logician who wrote that ’’plurality 
must never be posited without necessity” [T]. Occam asks us to look 
through a family of possible models and pick the simplest one consistent 
with some observed data. Consider a mathematical form of Occam’s 
process in which the family of models is a set A of functions on a set 
X, the simplicity of a model is relative to some partial order on A 
and the data we are given are the values of some unknown function 
/ G A on S' C A. The key issue is how to decide if and when this data 
determines a unique function / G A. We make this decision as follows. 

Definition 1. Let (A, <) be a partially ordered set of functions on a 
set X. For S C A and f & A , we say f is Occam on S or S is Occam 
for f if f is the least element of {g & A \ g\s = f\s }■ 

In the case where the partial order is equality, we are familiar with 
many examples where functions are completely determined by their 
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values on particular sets. Entire functions are Occam on any open set. 
Linear transformations are Occam on any basis. Polynomials of degree 
at most 3 are Occam on any four element set. Families of functions are 
often determined by their values on finite sets, so it will be useful to 
know the size of the smallest set that determines a function uniquely. 

Definition 2. Let (v4, <) be a partially ordered set of functions with 
domain X and let f G A. The radius of f is defined by R{f) = 
minds'! I S C X and S is Occam for f} if this number is finite and oo 
otherwise. Note that if f is the smallest element of A, then the empty 
set is Occam for f so R{f) = 0. 

Here is a simple example that we will shortly generalize in which the 
radius of each function is equal to 2. 

Example 1. Write a function f: {1,2,3} {a,b} as a 3 letter word 

in the letters from the alphabet {a, b} and consider the set of functions 
represented by A = {aba, bab, aab, bba, aaa, bbb\. Then in (A, =) each 
of these functions is Occam on the subset o/{l,2,3} indicated by the 
two underscores. 

In Example [H is it possible to make A larger and still have the radius 
of each function equal to 2? This easily stated question suggests the 
following unusual family of difficult combinatorial problems. 

Problem 1. For fixed m,n and r, what is the maximum possible value 
of |H| where |X| = m, |E| = n and (H, =) is a set of functions from 
X to Y with R{f) < r for all f ^ A. Let Occ{m,n,r) denote this 
maximum value. 

It is straightforward to show that Occ(l,n, 1) = n, Occ(m, n, 1) > 
m(n — 1) and Occ{m, n, m) = n^. Here is an inequality for the general 
case. 

Theorem 1. Occ{m,n,r) is less than or equal to the largest number 
p satisfying the following three conditions for some choice of the non¬ 
negative integers Xi: 

n-m-r 

(1) p=Yl ixi E Xi < W p < 

i=l i=l 

Proof. Let A be any set of functions from an m element set X to 
an n element set Y. Given an r element subset S Y X, dehne an 
equivalence relation on H by / g if f\s = g\s- Suppose that the 
distinct nonempty elements of the partition of A induced by are 
Pi, P 2 ,..., Pk where k <W since there are at most W distinct functions 
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in the restriction of functions in A to S' . There are at most func¬ 
tions that agree with any particular / on S' so |Pj| < If Xi is the 

number of times that the number i occurs in the list |Pi|, IP 2 I, • • •, l-ffcl, 
then |/l| = Y!1=i \Pi\ = YJt ixi- Since k < n^, we must also have 
that Now suppose that Si, S 2 ,..., S'^m^ are all of the 

r elements subsets of X and let Xij denote the number of singleton 
sets in the partition induced by the equivalence relation By our 
assumption that every element in A has radius less than or equal to r, 

we must have — 1^1- follows that for at least one value jo 

of j we must have xij^ > |v4|/and the result follows. □ 

Theorem [T] can be used in conjunction with a computer to hnd upper 
bounds for Occ{m, n, r) for small values of m, n and r. 

Computer Search 1. If we fix m = 3 and r = 2 and consider the 
values n = 2, 3,4, 5 in EquationUl then the corresponding maximum 
possible values of p are 6,15,31,53 respectively. 

It follows that Occ(3,3,2) < 6, Occ(3,3,2) < 15 and Occ(3,4,2) < 
31 but it remains to determine if these upper bounds can actually be 
attained. Here is what is known about the radius 2 case. 

Theorem 2. 

(1) Occ(3,2,2) = 6 

(2) Occ(3,3,2) = 15 

(3) 28 < Occ(3,4,2) < 31 

(4) 4 ( 2 ) -f n < (9cc(3, n, 2) 

(5) 2m < Occ{m, 2, 2) 

Proof. Parts (1) to (4) of this theorem concern Occ{3,n,2) which we 
will bound by generalizing the pattern in Example [U There are ( 2 ) 
different pairs of letters. For each pair {u, u} taken from the n letters 
in the alphabet, we add the four functions uvumuv, miv, vvu for a 
total of 4 ( 2 ) functions. We then add the n constant functions of the 
form u uu for a total of 4 ( 2 ) + n functions which are all Occam on the 
2 underscored positions. It follows that 4 ( 2 ) + n < Occ{3,n,2) and 
parts (1) to (3) now follow from the results of Computer Search [H 
For Part (5), consider the special case where m = 5. The 5 func¬ 
tions aaaaa,abaaa, abbaa, abbba and gfibbb together with the 5 functions 
Mbb, babbb, baabb , baaab and baaaa are all Occam on the 2 under¬ 
scored positions.This pattern generalizes to give the result. 

□ 
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2 . The Radius of a Subgroup 

In this section, we develop Occ(G) for a group G which is analogous 
to the rank of G but has the nice property that if R is a subgroup 
of G then Occ{H) < Occ(G). In order to apply the ideas of the last 
section, we will associate with any group G the set of functions Aq in 
the following way. 

Definition 3. Let G be any group and H be any subgroup of G. Define 
Xh : G —)■ { 0 , 1 } to be the characteristic function of H that takes the 
value 1 at each element of H and the value 0 at elements of G not in 
H. Let Ag denote the set of all characteristic functions of all subgroups 
ofG. 

The elements of Aq for the dihedral group D 4 of order 8 are listed 
on the left hand side Table [U We now use Definition [2] and define the 
radius of a subgroup to be the radius of its characteristic function. 

Definition 4. Let G be any group and H be any subgroup of G. Define 
the radius of H in G or Rg{H) to be the radius of xh in [Aq, =). 

Table [T] shows the radii of the subgroups of D^. These numbers 
capture information about how the subgroup sits inside the group. 

The Radius of Subgroups of D 4 


Xh 

RnfiH ) 

11111111 

2 

10000000 

5 

10000100 

2 

10000100 

2 

10000010 

2 

10000001 

2 

10100000 

4 

11110000 

2 

10101010 

3 

10100101 

3 


Table 1 . The radii of the subgroups of D4 found by a 
computer program. 


Theorem 3. If G is any group, then Rg{G) is the rank of G. 

Proof. Suppose that rank of G is n and that S = {gi, g 2 ,..., Qn} gen¬ 
erates G. S must be Occam for xg because if xh = Xg on S then 
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H contains S which implies H = G and therefore xh = Xg- If fol¬ 
lows that R{xg) < Now suppose by way of contradiction that 
R{Xg) = m < n and that T = {gi,g2, ■ ■ ■ ,5'm} is Occam for xg- H 
H =< gi, g2, ■ ■ ■ ,gm > then xh = Xg on T. But H must be a proper 
subgroup of G which means xh 7^ Xg and T is not Occam for xg- D 

We will be particularly interested in the radius of the identity sub¬ 
group of a group G which we will write as Occ{G). 

Definition 5. If G is any group, define Occ(G) = i?G'({e}). 

Values of Occ(G) are shown for various groups in Table [2] and we see 
that these small groups are classihed by their order, rank and Occ{G). 


The Radius of the Identity for Various Groups 


Group G 

Order 

Rank 

Occ{G) = RcHe}) 

Z2 

2 

1 

1 

Zz 

3 

1 

1 

^4 

4 

1 

1 

Z2 X Z2 

4 

2 

3 

^5 

5 

1 

1 

^6 

6 

1 

1 

^3 

6 

2 

4 

Z 7 

7 

1 

1 

^8 

8 

1 

1 

Q 

8 

2 

1 

Z2 X .^4 

8 

2 

3 

D, 

8 

2 

5 

Dicylic 

12 

2 

2 

Z2 X Zq 

12 

2 

4 

A, 

12 

2 

7 

De 

12 

2 

8 


Table 2 . The values of Occ{G) found by a computer program. 


Unlike the rank, Occ has the nice property that if iif is a subgroup 
of G then Occ{H) < Occ{G) as we now show. 

Theorem 4. If H is a subgroup of G then Occ{H) < Occ{G). 

Proof. It is enough to show that if U is Occam for X{e} in (^g, =) then 
U nH is Occam for X{e} in {Ah, =); the relationship between the radii 
will then follow because \U H H\ < \U\. Now U is Occam for X{e} in 
{Ag, =) means that if K is any subgroup of G with U K = U {e} 
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then K = {e}. To see that UnH is Occam for X{e} in =), suppose 
that L is any subgroup of H and that {U D H) D L = (U O H) D {e}. 
This equality simplihes to UHL = U n{e} which implies L = {e} since 
U is Occam for X{e} in =)• D 

Using Theorem 0] and Table [2] we can verify the well-known rela¬ 
tionships that Z 2 X Z 2 cannot be a subgroup of the Quaterion group 
of order 8 or the Dicylic group of order 12 and also that S 3 is not a 
subgroup of the Dicylic group. 

Questions 1. What is the smallest example of two nonisomorphic 
groups with the same order, rank and Occ? How do we characterize 
the groups G with Occ{G) = 1 which include the cyclic groups and the 
Quaternion group of order 8 ? 

3 . The Fusion Sequence of a Group 

We will make a construction that allows us to associate with any 
poset of functions {A, <) an inhnite sequence of positive integers called 
its fusion sequence. The first step is to identify some special subsets of 

A. 

Definition 6. Let {A, <) be a partially ordered set of functions on a 
set X and let S be a subset of A. Define the fusion set of S to be the 
set Fs of all functions f ^ A that are Occam on S. 

Give a function / in some poset of functions, we will want to know 
the size of the smallest fusion set containing /. We call this number 
the fusion number of /. 

Definition 7. Let (^, <) be a partially ordered set of functions on a 
set X and let f G A. Define the fusion number of f to be mindF^I | 
f E Fs} if finite and infinity otherwise. 

The next construction starts with a poset of functions {A, <) and 
uses its fusion sets to build a new poset of functions on the set A. 

Definition 8. Let {A, <) be a partially ordered set of functions on 
a set X. Define an infinite seguence of partially ordered sets {A, < 
)i 5 {A, <) 2 ,..., called the ascendents of {A, <), as follows. The first 
ascendent of {A, <) is {A, <)i = ({xfsI*S' C X}, -<) where -< orders the 
characteristic functions by subset inclusion and the domain of each of 
the functions xfs is the set A. Now define {A, <)2 = ({xfsI*S' C X}, -< 
)i and continue inductively. 

We note that xa is the maximal element of {A, <)i which implies 
that every ascendent of {A, <) has a maximum element. The radii of 
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these maximal elements are the terms in the fusion sequence of {A, <) 
that we now dehne. 

Definition 9. Let (A, <) be a partially ordered set of functions on a 
set X. Each ascendent of {A, <) has a maximal element so we let Fn be 
the fusion number of the maximal element of the nth sequent of {A, <) 

. The sequence Fi, F2 ,... will be called the fusion number sequence of 
{A, <). If {A, <) happens to have a maximal element, then we denote 
the fusion number of that element by Fq. 

Within this general context of fusion sequences, we now focus on the 
following poset of functions associated with a group G. 

Definition 10. Let G be any group and H he any subgroup of G. 
Define xh ■ G ^ {0,1} to be the characteristic function of H that 
takes the value 1 at each element of H and the value 0 at elements of 
G not in H. Let {Aq, -<) denote the set of all characteristic functions 
of all subgroups of G ordered by inclusion. In other words, xh -< Xk if 
H CK. 

If G is any group then we can give a precise description of the fusion 
sets. 

Theorem 5. Let G be any group. If S is any subset of G and Fs is the 
fusion set of S with respeet to {Aq, -<), then Fg = {x<i?>l I R ^ S}. In 
other words, Fg consists of the characteristic functions of all subgroups 
generated by all possible subsets of S with the convention that the empty 
set generates the identity subgroup. 

Proof. To see that{x<R>| | F C S'} C Fg, let F C S and suppose 
that Xh agrees with x<R> 011 S. Then xh agrees with x<R> F and 
therefore R H and < F >C if so x<R> is Occam on S. For the 
reverse inclusion, suppose xh ^ Fg. Let F = if fl S which implies 
< R >C H and < F > flS = if fl S. It follows that x<R> Xh and 
that x<R> agrees with xh on S. But by assumption S is Occam for 
Xh so we must have x<R> = Xh- D 

Table [3] shows the hrst few terms in the fusion sequences of various 
groups. Here is a sample calculation for the group G = Z^. 

Example 2. Consider the calculation of the fusion sequence of {Aq, -<) 
in the case where G = Z/^ = {0,1, 2, 3}. In this case, 

Azi = {(1, 0, 0, 0), (1, 0,1, 0), (1,1,1,1)} and the fusion number of 
the maximal element (1,1,1,1) is Fq = 2. 

{Ag, -<)i =({(!, 0 , 0), (1,1, 0), (1, 0,1), (1,1,1)}, -<) and the fusion num¬ 
ber of the maximal element (1,1,1) is Fi = 4. 
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(Ag, ^)2 = ({(1, 0, 0, 0), (1, 1 , 0 , 0), (1, 0, 1 , 0 ), (1, 1 , 1 ,1 )}, and the 
fusion number of the maximal element (1,1,1,1) is F 2 = 2. 


Group 

Fo 

Fi 

F 2 

F 3 

Zp p a prime 

2 

2 

2 

2 

^4 

2 

4 

2 

4 

Z 2 X Z 2 

4 

8 

2 

? 

Ze 

2 

4 

4 

8 

S 3 

4 

16 

2 

? 

Zs 

2 

8 

2 

8 

F4 

4 

64 

? 

? 

Q 

4 

16 

2 

? 

Z9 

2 

4 

2 

4 

Z 3 X Z3 

4 

16 

2 

? 

Zio 

2 

4 

4 

8 

Z 12 

2 

8 

4 

? 

Zi 4 

2 

4 

4 

8 

Z 16 

2 

16 

2 

? 


Table 3 . 


'he first four fusion numbers of (/ 


G5 


with a computer program. 


-<) found 


The values of Fq in Table [3] suggest the following theorem. 

Theorem 6. If G is any group with finite rank n then Fq = 2”. 

Proof. The maximal element of (Tg,-<) is xg and Fq = min{|T 5 '| | 
Xg £ ^s} by DefinitionlHl By Theorem|5l Fs = {x<i?>| | C S'} which 
means Fq = mindF^I | S generates G }. Suppose T = {gi,g 2 , ■ ■ ■ ,gn} 
generates G where n is the rank of G. Then every subset of T generates 
a distinct subgroup of G whose characteristic function is Occam on T 
and therefore Fq < \Ft\ = {x<R>l \ R F T} = 2"-. It remains to show 
that if S is any generating set for G then IT^I > 2"’. For each i, we can 
hnd a finite number of elements of S whose product is gi G T. Together, 
all of these elements of S used to generate the elements of T form a 
hnite generating set of G which must contain a set U that generates 
G and is minimal in the sense that no proper subset of U generates 
G. If Ri and R 2 are two different subsets of U then we cannot have 
< Ri > = < R 2 > because then at least one of the elements of U could 
be expressed in terms of other elements of U contradicting U being a 
minimal generating set. Consequently, the set {< R >\ R F U} must 
consist of precisely 2l^l different subgroups where as usual we take < 
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0 >= {e} . Therefore \Fs\ = \{< R >\ R C ^}| > \{< R >\ R C U}\ 
= 2l^l > 2"', since \U\ > n = rank{G) by the dehnition of rank. □ 

It is well known that the rank of a group is not effectively computable 
in general so it follows from Theorem [6] that the same must be true for 
Fq. On almost no evidence we make the following conjecture: 

Conjecture 1. If G is any finite group then the fusion sequence of G 
is periodic. 

We have used maximal elements and subset inclusion to dehne the 
fusion sequence but there are other possible approaches that we will 
consider elsewhere. 
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